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ALZHEIMER'S DISEASE THERAPEUTICS 
The field of the invention is Alzheimer's disease 
therapeutics . 
5 Background of the Invention 

Alzheimer's disease (AD) is a progressive 
degenerative disorder of the brain that afflicts over 
four million people in the United States. No effective 
treatment is available. The most characteristic change 

10 observed upon post-mortem histopathological analysis of 
AD-afflicted brain tissue is the presence of neuritic and 
cerebrovascular plaques containing dense deposits of p- 
amyloid protein (Selkoe, Cell 58:611-612, 1989). 0- 
amyloid is a 39-43 amino acid peptide (Glenner and Wong, 

15 biochem. biophys. Res. Commun. 120:885-890, 1984; Masters 
et al., Proc. Natl. Acad. Aci. USA 82:4345-4249, 1985) 
synthesized as part of a larger precursor protein 
referred to as amyloid precursor protein (APP) , which is 
known to have a number of isoforms in humans (APP 695 , Kang 

20 et al., Nature 325:733-736, 1987; APP 751 , Ponte et al., 
Nature 331:525-527, 1988, and Tanzi et al., Nature 
331:528-530, 1988; and APP 770 , Kitaguchi et al. , Nature 
331:530-532, 1988). The amino terminal of 0-amyloid is 
generated by cleavage of a peptide bond of APP which in 

25 APP 695 lies between Met596 and Asp597. 

Although structural alterations of APP are 
implicated in the pathogenesis of Alzheimer's disease, it 
remains unknown how they cause the disease. No 
biological function for APP has been identified, although 

30 there is evidence that APP has a receptor-like 

architecture (Kang et al., Nature 325:733-736, 1987; 
Ponte et al., Nature 331:525-527, 1988; Tanzi et al., 
Nature 331:528-530, 1988; Kitaguchi et al., Nature 
331:530-532, 1988), is located n the neuronal surface 

35 (Dyrks et al., EMBO J. 7:949-957, 1988), and possesses an 
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evolutionarily conserved cytoplasmic domain (Yamada et 
al. , Biochem. Biophys. Res. Commun. 149:665-671, 1987). 

Summary of the Invention 
The methods and therapeutical compositions of the 
5 invention are based upon the discovery, described in 
detail below, that APP forms a complex with G Q , a major 
GTP-binding protein (or "G protein") in brain. Like all 
G proteins, a molecule of G Q is made up of one a subunit 
and one Py subunit. Two isoforms of G Q , known as G ol (or 

10 g oa) and G o2 ( or g ob) ' have been identified; they have 
slight amino acid differences in their a subunits, and 
are together referred to herein as G Q . The cDNA sequence 
and deduced amino acid sequence of the a subunits of each 
of G o1 and G o2 (as reported by Strathmann et al., Proc. 

15 Natl. Acad. Sci. USA 87:6477-6481, 1990) are shown in 
Fig. 4a (SEQ ID NO: 2) and Fig. 4b (SEQ ID NO: 28), 
respectively. 

The finding that APP associates with G Q is 
consistent with related findings concerning other 

20 G proteins, as disclosed in a second application 

(USSN ) having the same inventor and filing 

date as the present application, which second application 
is herein incorporated by reference. The cytoplasmic 
APP 695 sequence His 657 -Lys 676 (SEQ ID NO: 1) possesses a 

25 specific G Q -activating function, and is necessary for 
complex formation of this APP with G Q ; this sequence, 
sometimes referred to as the "couplone" region of APP, is 
completely conserved in APP 751 and APP 770 , as well as in 
mouse APP 695 . This provides evidence that APP is a 

30 receptor coupled to G Q , and suggests that abnormal APP-G Q 
signalling is involved in the Alzheimer's disease 
process . 
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The invention includes a method of identifying a 
therapeutic useful for treating or preventing Alzheimer's 
disease, which method includes the steps of 

contacting (a) a first molecule containing the 
5 couplone portion of APP (SEQ ID NO: 1) with (b) a second 
molecule containing the amino acid sequence of G Q (SEQ ID 
NO: 2) or an APP-associating region of G Q (SEQ ID NOs: 3, 
4, or 5), in the presence of a candidate compound; and 
either (i) determining whether the candidate 

10 compound interferes with (i.e., inhibits partially or 
completely) the association of the first and second 
molecules, or (ii) determining whether the candidate 
compound interferes with the activation of the second 
molecule by the first molecule, such interference being 

15 an indication that the candidate compound is a potential 
therapeutic useful for treating or preventing Alzheimer's 
disease. The determining step may be accomplished by, 
for example, immmunoprecipitating the first molecule with 
an antibody specific for APP, and detecting the presence 

20 or amount of the second molecule which co-precipitates 
with the first molecule. Alternatively, the second 
molecule can be immunoprecipitated with an antibody 
specific for G 0 , following which the presence or amount 
of the first molecule which co-precipitates with the 

25 second molecule is determined. Where activation is the 
criterion being measured, the determination step may be 
accomplished by contacting the second molecule with a 
substrate which is or includes GTP or an analog of GTP 
[such as GTPyS or Gpp(NH)p], and detecting or measuring 

30 the binding of the substrate to the second molecule, 
wherein such binding is evidence of activation of the 
second molecule by the first molecule. In preferred 
embodiments, the contacting step is carried out in a 
cell-free system; the Mg 2+ concentration at which the 

35 contacting step is carried out is between approximately 
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lxlCT 7 and lxlCT 2 M r and the first molecule includes the 
cytoplasmic tail portion of APP 695 from residues 649 to 
695 (SEQ ID NO: 6) and/or the membrane-spanning portion 
of APP 695 from residues 639 to 648 (SEQ ID NO: 7) (the 
5 entire membrane-spanning segment of APP 695 being from 
/residues 625 to 648, SEQ ID NO: 8); the first molecule 
more preferably includes substantially all of APP (SEQ ID 
NO: 9). (Alternatively, the corresponding functional 
regions of APP 751 or APP 770 , or any other APP, may be 

10 used.) The second molecule preferably contains two or 
three of the putative APP-associating regions referred to 
above, and may also contain one or more of the GTP- 
binding regions of G Q , corresponding to residues 35 to 50 
(SEQ ID NO: 10), residues 201 to 218 (SEQ ID NO: 29), or 

15 residues 263 to 274 (SEQ ID NO: 30) of G ol [Kaziro, 

"Structure of the genes coding for the a subunits of G 
proteins", Ch. 1 in ADP-ribos vlatina Toxins and G 
proteins (Moss, J., and Vaughan, M. eds.) ppl89-206, 
American society for Microbiology, Washington, D.C. 

20 (1988)], and more preferably contains substantially all 
Of G Q (SEQ ID NO: 2) . 

The invention also includes a system (e.g., a 
cell-free in vitro system) for screening candidate 
Alzheimer's disease therapeutics, which system includes a 

25 first polypeptide containing a sequence essentially 
identical to that of peptide 20 (SEQ ID NO: 1) , and a 
second polypeptide containing a sequence essentially 
identical to one, two or three of the putative APP- 
associating regions of G Q (SEQ ID NOs: 3, 4, and 5); the 

30 system may also include a means for detecting either (a) 
the association of the first polypeptide with the second 
polypeptide, or (b) the activation of the second 
polypeptide by the first polypeptide. The first 
polypeptide may conveniently be anchored t a solid 

35 material (e.g., a cellular membrane, a polystyrene 
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surface, or a standard matrix material), or may be in a 
phospholipid vesicle. It may include a sequence 
essentially identical to the membrane-spanning region of 
APP, and/or a sequence essentially identical to the 
5 entire cytoplasmic tail of APP. The second molecule 
preferably contains the GTP-binding domain of G Q , and 
more preferably contains the entire sequence of G Q . 

The invention also features a method for 
diminishing the activation of G Q in a neuronal cell by 

10 treating the cell with a compound, such as a peptide 
fragment of G D or of the cytoplasmic tail of APP, which 
blocks association of neuronal G c with, and/ or activation 
of neuronal G Q by, the cytoplasmic tail of APP. The cell 
may be so treated in vivo (i.e., in an animal, e.g. a 

15 mammal such as a human or other primate, cow, horse, pig, 
sheep, goat, dog, cat, rat, mouse, guinea pig, hamster, 
or rabbit) or in vitro. This method may be used to 
prevent or treat the symptoms of Alzheimer's disease in a 
patient. Such a compound may include, for example, a 

20 peptide having fewer than 50 amino acids (preferably 40 
or fewer, and more preferably 30 or fewer) , and 
containing the sequence of peptide 20. Also within the 
invention is a DNA molecule (e.g., a plasmid or viral 
DNA) encoding such a peptide, and a therapeutic 

25 composition containing, in a pharmaceutically acceptable 
carrier, either the peptide or the DNA molecule. 

In another aspect, the invention features a method 
for identifying a ligand for which APP is a receptor, 
which method includes the steps of 

30 providing an APP molecule, the cytoplasmic tail of 

which is accessible to a molecule of G Q ; 

contacting a candidate compound with the 
extracellular domain of the APP molecule; and 

detecting either (a) association of G Q with the 

35 APP molecule, (b) dissociation of G Q from the APP 
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molecule, or (c) activation of G 0 by the APP molecule, 
such association, dissociation, or activation being 
evidence that the candidate compound is a ligand of APP. 
Other features and advantages of the invention 
5 will be apparent from the detailed description set forth 
below, and from the claims. 

Brief D escription of the Drawings 
Fig. 1(a) is a schematic diagram illustrating the 
structural organization of APP. The hatched box contains 
10 the sequence of the /3/A 4 protein; the black box contains 
the so-called "Peptide 20" or couplone sequence; filled 
circles are N-glycosylation sites. The numbers designate 
amino acid sequence numbers corresponding to APP 695 . 

Fig. l(b) is a bar graph illustrating the effects 
15 of synthetic APP peptides on G e . In (b) , (d) , (e) and 
(f), values represent the mean ±S.E. of three 
experiments . 

Fig. i(c) is a graph illustrating the time course 
of the action of peptide 20 on G D . Values represent the 
20 mean of three experiments. Since the S.E. was < 5% of 
each value in this figure, the error bars are not 
indicated. 

Fig. l(d) is a graph illustrating the effects of 
peptide 20 variants on G Q . 
25 Fig. l(e) is a graph illustrating the effect 

linkage with a transmembrane region has on the action of 
peptide 20 on G Q . 

Fig. l(f) is a graph illustrating the effect of 
pertussis toxin on peptide 20-induced stimulation of GTP- 
30 yS binding to G Q . 

Figs. 2a-2d is a set of SDS-PAGE gels analyzed by 
immunoblotting, which illustrate the immunoprecipitation 
of APP and G D by an anti-APP antibody from brain 
membranes. (a) Immunopr cipitation of APP by 22C11. 
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(b) Immunoprecipitation of G D by 22C11. (c) Effect of 
Mg 2 * on the immunoprecipitation of G Q by 22C11. 

(d) Effect of peptide 20 on 22Cll-induced precipitation 
of G oa (left) and APP (right) . Each of the results 
5 presented in this figure was reproduced at least three 
times . 

Fig. 3a is a schematic diagram of the construction 
method used to prepare recombinant mutant APP cDNAs • 
Regions labeled ATG, TAA, TGA signify original 
10 translation and termination sites and a newly inserted 
termination site, respectively. 

Fig. 3b is a schematic diagram comparing the 
structures of authentic APP 695 and the two recombinant 
mutant APP polypeptides, AN and AC. 
15 Fig. 3c is an immunoblot analysis of Sf9 membranes 

using anti-Alz 90, 1C1, and 4G5. 

Fig. 3d is an immunoblot analysis of the 22C11- 
precipitate from an Sf9 membrane-G 0 reconstitution 
mixture. 

20 Fig. 3e is an immunoblot illustrating dissociation 

of G Q from APP by activation of G G . Each of the results 
presented in Figs. 3c-e was reproduced at least three 
times . 

Fig. 4a is the cDNA sequence and deduced amino 
25 acid sequence of G ol a (Strathmann et al., Proc. Natl. 
Acad. Sci. USA 87:6477-6481, 1990) (SEQ ID NO: 2). 

Fig. 4b is the cDNA sequence and deduced amino 
acid sequence of G o2 a (Strathmann et al.) (SEQ ID NO: 28). 

Detailed Description 
3 0 It was previously shown that the insulin-like 

growth factor II receptor (IGF-IIR) couples directly to 
the G protein referred to as G L (Nishimoto et al., J. 
Biol. Chem. 264:14029-14038, 1989) via a 14-residue 
section of the cytoplasmic tail of IGF-IIR, Arg 2410 -Lys 2423 
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(Okamoto et al., Cell 62:709-717, 1990; Okamoto et al., 
Proc. Natl. Acad* Sci. U.S.A. 88:8020-8023, 1991). The 
structural determinants for the G^activating function in 
IGF-IIR were defined as (i) two basic residues at the N- 
5 terminal region of the amino acid sequence, and (ii) a C— 
terminal motif of B-B-X-B or B-B-X-X-B (where B is a 
basic residue and X is a non-basic residue) (Okamoto et 
al., Cell 62:709-717, 1990). To assess whether APP might 
function as a G protein-coupled receptor, the amino acid 

10 sequence of human APP695 was examined for regions of less 
than 26 residues which satisfy (i) and (ii) . The 
sequence His 657 -Lys 676 is the only such region in the 
cytoplasmic domain of APP695. In two other isoforms of 
APP, APP751 (Ponte et al., Nature 331:525-527, 1988; Tanzi 

15 et al., Nature 331:528-530, 1988) and APP770 (Kitaguchi et 
al., Nature 331:530-532, 1988), as well as in mouse APP695 
(Yamada et al., Biochem. Biophys. Res. Commun. 149:665- 
671, 1987), this sequence is completely conserved. 

Preparation of peptides 

20 A peptide corresponding to the His 657 -Lys 676 region 

Of APP [ HHGWEVDAAVTPEERHLSK (SEQ ID NO: 1)] was 
synthesized and purified by standard methods using solid 
phase synthesis; this peptide is referred to as 
"peptide 20". Similarly prepared were peptides 

25 corresponding to other regions of APP 695 : 
APP(l-lO), MLPGLALLLL (SEQ ID NO: 11); 
APP(597-606) , DAEFRHDSGY (SEQ ID NO: 12); 
APP (677-695) , MQQNGYENPTYKFFEQMQN (SEQ ID NO: 13); 
and APP(639-648) , TVIVITLVML (SEQ ID NO: 7), a portion of 

30 the transmembrane region of APP; 

as well as the following variants of peptide 20: 
HGWEVDAAVTPEERHLSK (H-deleted, SEQ ID NO: 14); 
GWEVDAAVTPEERHLSK (HH-deleted, SEQ ID NO: 15) ; 
HHGWEVDAAVTPEE (RHLSK-deleted, SEQ ID NO: 16) ; 
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KQYTSIHHGWEVDAAVTPEERHLSK (KQYTSI-added, SEQ ID NO: 17); 
and TVIVITLVML HHGWEVDAAVTPEERHLSK (transmembrane region- 
connected peptide 20; SEQ ID NO: 18). 
Peptides were purified by HPLC to greater than 95% 

5 purity, and were used immediately after synthesis. 

Materials and Methods. 

Triineric G 0 was purified to homogeneity from 
bovine brain as described (Katada et al., FEBS Lett. 
213:353-358, 1987) • This G G preparation was stored in 20 

10 mM Hepes/NaOH (pH 7.4), 1 mM EDTA, and 0.7% CHAPS, and 
diluted £ 10 fold for assays. G i3ct , which was used in 
combination with 1.5-fold concentrated G0y (Okamoto et 
al., Natl. Acad. Sci. U.S.A. 88:8020-8023, 1991), was 
prepared as described by Morishita et al., Biochim. 

15 Biophys. Acta 161:1280-1285, 1989. Low molecular weight 
G proteins were prepared as described by Matsui et al . , 
J. Biol- Chem. 263:11071-4, 1988; G0y was purified from 
bovine brain as set forth in Katada et al., FEBS Lett. 
213:353-358, 1987. 

20 GTPyS binding to G D was assayed in a buffer 

containing 50 mM Hepes/NaOH (pH 7.4), 100 /xM EDTA, 120 jiM 
MgCl 2 , and 60 nM [ 35 S]GTPyS (DuPont-New England Nuclear) 
at 37 °c, and the fraction of total G Q bound to GTPyS was 
measured as described (Okamoto et al., Cell 62:709-717, 

25 1990). GTPyS binding to peptides was negligible. The 
total amount of G Q in a given preparation was defined as 
the saturation amount of GTPyS bound to 6 Q following a 
30-min incubation of G c with 10 mM Mg 2+ and > 60 nM GTPyS 
at 30°C. 

30 Reconstitution of G c into phospholipid vesicles 

was accomplished with 1 mg/ml of phosphatidylcholine, 
using the gel filtration method (Nishimoto et al., J. 
Biol. Chem. 264:14029-14038, 1989). In a final 
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incubation for GTPyS binding, 5 nM of reconstituted G Q 
was used. 

For experiments exploring the effect of Mg 2+ , the 
Mg 2+ concentration was set by using Mg-EDTA buffer 
5 (Birnbaumer et al. , J. Eur. J, Bicchem. 136:107=112, 
1983). 

Bovine brain membranes, prepared as described 
(Katada et al., FEBS Lett. 213:353-358, 1987) and 
suspended in buffer A [10 mM Hepes/NaOH (pH 7.4) , 1 mM 

10 EDTA, 10 mM acetic acid, and 250 mM sucrose, plus a 

mixture (termed "PAL") of 2 mM PMSF, 20 /xg/ml aprotinin, 
and 20 yM leupeptin] , were centrifuged and the pellet was 
solubilized for 1 h at 4°C in buffer B (10 mM Hepes/NaOH 
(ph 7.4), 1 mM EDTA, 120 mM NaCl, 0.5% CHAPS, and PAL). 

15 Following centrifugation of the material at 15000 rpm for 
1 h, the supernatant (500 fig protein, unless specified) 
was incubated in buffer C (20 mM Hepes/NaOH (pH 7.4), 1 
mM EDTA, 120 mM NaCl, and PAL) and 2% BSA with 22C11- 
coated protein G-Sepharose, which had been prepared by 

20 incubating protein G-Sepharose (Pharmacia) with anti-APP 
monoclonal antibody 22C11 (Boehringer Mannheim) for 1 h 
at 4°C. An antibody concentration of > 2 /xg /ml was found 
to saturate precipitation of APP and G Q , so 2 nq/ml was 
the concentration used for immunoprecipitation studies. 

25 As a control, 2 jug/*l of rabbit IgG was used. After 
overnight shaking at 4°C, the immunoprecipitated sample 
was centrifuged at 5000 rpm for 5 min. The pellet was 
washed three times with ice-cold buffer C and the final 
pellet was applied to SDS-PAGE. Electroblotting onto a 

30 PVDF sheet was performed as described (Okamoto et al., J. 
Biol. Chem. 266:1085-1091, 1991). After blocking with 
PBS containing 2% skim milk and 1% BSA, the sheet was 
incubated with the first antibody [1 fig/nl of 22C11; 
1/iooo dilution of anti-G Q a monoclonal antibody GC/2 

35 (DuPont-New England Nuclear) ; 1/1000 dilution of 1C1, a 
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monoclonal antibody against the C-terminal peptide 677-695 
of APP 695 ] for 4 h, and then exposed to horseradish 
peroxidase-conjugated goat IgG reactive for mouse or 
rabbit immunoglobulins for 2-4 h at room temperature. 
5 The antigenic bands were detected with an ECL detection 
kit (Amersham) . YL1/2 (SERA Lab) , an anti-tubulin 
antibody, was used at 1:500 dilution for immunodetection. 

Effects of .synthetic AFP peptides on G proteins. 

In the experiment shown in Fig. 1(b) , 10 nM G G was 

10 incubated with water or 100 iM of each peptide for 2 min, 
and the amount of GTPyS bound to G Q at the end of this 
period was measured. In the experiment shown in Pig. 
1(c), lOnM G Q was incubated with water (O) or 100 /iM 
peptide 20 (SEQ ID NO: 1) (•) , and GTPyS binding was 

15 measured at the indicated times. From Fig. 1(d), it can 
be seen that peptide 20 (SEQ ID NO: 1) stimulated the 
rate constant of GTPyS binding to G Q in a dose-dependent 
manner, whereas Fig. 1(b) shows that peptides from other 
regions of APP695 were ineffective. GTPyS binding to G Q 

20 in the presence or absence of peptide 20 (SEQ ID NO: 1) 
obeyed first-order kinetics according to the equation 

In [fBr-B;/Br]=-* app t 
(B is the binding at time t; Bt is the total binding 
observable at infinite time; and Jc app is the rate constant 

25 for GTPyS binding). The ability of peptide 20 (SEQ ID 
NO: 1) to activate G Q was gradually decreased during 
storage at either -4°C or -20°C. 

Studies using structural variant peptides suggest 
that both the N-terminal basic residues and the C- 

30 terminal B-B-X-X-B motif play essential roles in the G c - 
activating function of peptide 20 (SEQ ID NO: 1) [Fig. 
1(d)]. In this experiment, 10 nM G Q was incubated with 
various concentrations of HHGWEVDAAVTPEERHLSK (peptide 
20, SEQ ID NO: 1; □) , HGWEVDAAVTPEERHLSK (H-deleted, SEQ 
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ID NO: 14; 0) , GWEVDAAVTPEERHLSK (HH-deleted, SEQ ID 
NO: 15; □) , HHGWEVDAAVTPEE (RHLSK-deleted, SEQ ID 
NO: 16; ♦ ), or KQYTSIHHGWEVDAAVTPEERHLSK (KQYTSI-added, 
SEQ ID NO: 17; I) , and GTPyS binding to G Q at 2 min. was 
5 measured. Fig. 1(d) indicates which aspects of primary 
structure determine the G 0 -activator function of peptide 
20 (SEQ ID NO: 1) . Deletion of either one or both of the 
N-terminal His residues nullified G 0 -activator function 
of the peptide. The peptide (SEQ ID NO: 16) in which the 

10 c-terminal five residues of peptide 20 (SEQ ID NO: 1) has 
been deleted is several times less potent than peptide 20 
(SEQ ID NO: 1). 

As illustrated in Fig. 1(e), G Q reconstituted in 
phospholipid vesicles was incubated with transmembrane 

15 region-connected peptide 20 

( TVIVITLVML HHGWEVDAAVTPEERHLSK . SEQ ID NO: 18; □) or the 
partial sequence of the APP transmembrane domain alone 
(TVIVITLVML, SEQ ID NO: 7; □) . Transmembrane region- 
connected peptide 20 (SEQ ID NO: 18) was also incubated 

20 with G Q in the absence of phospholipids and the presence 
of 0.07% CHAPS (♦). The transmembrane region-connected 
peptide 20 (SEQ ID NO: 18) stimulated G Q reconstituted in 
phospholipid vesicles with a potency 10 times greater 
than that of peptide 20 (SEQ ID NO: 1). The 

25 transmembrane region alone (SEQ ID NO: 7) was without 
effect on G Q . In the absence of phospholipids, 
transmembrane region-connected peptide 20 (SEQ ID NO: 18) 
showed an effect on G Q no more potent than peptide 20 
(SEQ ID NO: 1) . Therefore, the stimulatory action of 

30 this transmembrane region-connected peptide (SEQ ID 
NO: 18) is attributed to the peptide 20 (SEQ ID NO: 1) 
sequence; the potentiating effect of the transmembrane 
region may be exerted by interactions with phospholipids. 
In the experiment shown in Fig. 1(f) , ADP- 

35 ribosylation f G D was accomplished by incubating G c 
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reconstituted in phospholipid vesicles with 10 fig/ml 
preactivated pertussis toxin in the presence of 10/iM NAD 
for 15 min at 30°C as described (Okamoto et al,, Cell 
62:709-717, 1990). Preactivation of pertussis toxin 
5 (Funakoshi, Japan) was carried out by treating the toxin 
with 100 jM ATP and 1 mM DTT for 10 min at 30°C. 
Reconstitution of G Q into phospholipid vesicles was 
accomplished with 1 mg/ml phosphatidylcholine (Sigman, P- 
5638) at a final G Q concentration of 50.2 nM in a buffer 

10 containing 20 mM Hepes/NaOH (pH 7.4), 0.1 mM EDTA, 1 mM 
DTT, and 100 mM NaCl by the gel filtration method 
(Nishimoto et al., J. Biol. Chem. 264:14029-14038, 1989). 
In a final incubation for GTPyS binding, 5 nM of 
reconstituted G Q was used. Increasing concentrations of 

15 peptide 20 (SEQ ID NO: 1) were incubated for 2 min with 
G Q reconstituted in phospholipid vesicles which had been 
treated with pertussis toxin in the presence (♦) or 
absence (□) of NAD, and GTPyS binding to G D was measured. 

Although peptide 20 (SEQ ID NO: 1) produced 2-3 

20 fold stimulation of GTPyS binding to G Q in the mid-range 
of Mg 2 + concentrations, the effect of peptide 20 (SEQ ID 
NO: 1) could not be observed at low (^ 100 nM) or high (> 
10 mM) Mg 2+ concentrations. 

Peptide 20 (SEQ ID NO: 1) had little effect on G 

25 proteins other than G Q : G i3L , G i2 , G i3 , G s , c-Ki-ras p21 
and stag p25A were not stimulated by this peptide (data 
not shown). Thus, peptide 20 (SEQ ID NO: 1) activates G c 
in a receptor- like manner, suggesting that APP interacts 
directly with G Q through the peptide 20 (SEQ ID NO: 1) 

30 region. 

Coprecipitation of APP and G Q 

In an effort to determine whether APP is linked to 
G G in a native membrane environment, the coprecipitation 
studies shown in Fig. 2a were performed. Solubilized 

35 membranes of bovine brain were first immunoprecipitated 
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by monoclonal anti-APP antibody 22C11, and the 
immunoprecipitate was then probed by immunodetection with 
22C11 (Lane 2) or 1C1, a monoclonal antibody against the 
C-terminal peptide 677 _ 695 of APP (SEQ ID NO: 13; Lane 4). 
5 Lanes 1 and 3 of Fig. 2a indicate the controls in which 
either no solubilized membranes were included (Lane 1) , 
or rabbit IgG was used for the precipitation step instead 
of antibody 22C11 (Lane 3). In each control, 
immunodetection was performed with 22C11. The 55-kDa and 

10 25-kDa bands seen in Lanes 1 and 2 may be heavy and light 
chains of the 22C11 used for precipitation, which reacted 
with an anti-mouse IgG antibody during immunodetection. 
The precipitate by control rabbit IgG contained no 
detectable APP. Although the 100 kD molecular size of 

15 APP appears here to be slightly less than the 110-130 kD 
reported (Weidemann et al., Cell 57:115-126, 1989), the 
precipitated form is unlikely to be an extracellular 
fragment of APP, because 1C1 recognizes this 100-kDa 
band . 

20 In the experiment illustrated in Fig. 2b, 

coprecipitation of various G proteins with APP was 
investigated. Bovine brain membrane preparations were 
immunoprecipitated with 22C11; the immunoprecipitated 
proteins were subjected to SDS-PAGE and immunoblotted 

25 with the indicated anti-G protein antisera (1/1000 
dilution). Lane 2: GC/2, anti-G Q o antiserum; lane 3: 
GC/2 plus 1 Mg/ml of purified G D ; lane 4: GA/1, common Ga 
antiserum; lane 5: AS/7, anti-Gia antiserum; lane 6: 
MS/1, common G0 antiserum. Lane 1 shows a control 

30 immunoblot with GC/2, in which a buffer solution rather 
than the bovine brain membrane preparation was 
immunoprecipitated with 22C11. Lane 7 indicates 
immunoblotting with GC/2 of the precipitate resulting 
from immunoprecipitation of brain membranes with control 

35 rabbit IgG, rather than 22C11. Th identity of the 39- 
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kDa protein in lane 2 as G Q was verified by its absence 
in the non-membrane control (lane 1); by its staining 
with another G 0 a-specific antibody, aGOl (Morishita et 
al., Eur. J. Biochem. 174:7-94, 1988) (data not shown); 
5 and by a diminution of staining of this band in the 
presence of excess soluble G Q (lane 3) . The 22C11- 
precipitate also contained immunoreactivity of Gf3 in a 
doublet at 35-36-kDa (lane 6) . The 22Cll-precipitate did 
not react with an anti-Gid antibody AS/ 7 (lane 5) . The 

10 antibody GA/1 detected only a 39-kDa band in the 22C11- 
precipitate (lane 4) . The control rabbit IgG 
immunoprecipitate did not produce anti-G Q -immunoreactive 
bands corresponding to either APP or G Q (lane 7) ■. These 
experiments indicate that the 22Cll-precipitate from 

15 brain membranes contains APP immunoreactivity at 100 kDa, 
G Q a immunoreactivity at 39 kDa, and Gp immunoreactivity 
in a doublet at 35-36 kDa, but no detectable 
immunoreactivity indicating the presence of G L a or other 
heterotrimeric G proteins. A tubulin antibody, YL1/2, 

20 did not stain the 22Cll-precipitate (data not shown) . 

In the experiment shown in Fig. 2c, the effect 
of Mg 2+ concentration on co-precipitation of G Q with anti- 
APP antibody was studied. 100 ng of solubilized brain 
membranes were precipitated by 22C11 in the presence of 

25 various Mg 2+ concentrations controlled with Mg-EDTA buffer 
(Bimbaumer et al., J. Eur. J. Biochem. 136:107-112, 
1983). The precipitates were analyzed by immunoblotting 
with GC/2. The control lane indicates the results of 
precipitation of brain membranes by rabbit IgG followed 

30 by immunodetection with GC/2. In the absence of Mg 2+ , G Q 
was less efficiently co-precipitated by 22C11. Mg 2+ 
concentrations between 1 fM and 1 mM resulted in maximal 
immunoprecipitation of G D . At concentrations > 10 mM, 
relatively little G 0 was precipitated. In contrast, 

35 immun precipitation of APP by 22C11 was not affected by 


WO 94/19692 


PCT/US94/01712 


- 16 - 

Mg 2+ concentration (data not shown) . These results 
indicate that, while Mg 2+ is not absolutely required for 
complex formation by APP and G Q , the concentration of Mg 2+ 
does strongly influence complex formation, A mid range 

5 of Mg 2+ concentration was found to facilitate A?P-G Q 
association. 

Fig. 2d illustrates the results of an experiment 
indicating that peptide 20 (SEQ ID NO: 1) prevents the 
22Cll-mediated co-precipitation of G Q , whereas it did not 

10 affect the precipitation of APP by 22C11. In contrast, a 
control peptide (SEQ ID NO: 13) representing a segment of 
APP different from that represented by peptide 20 (SEQ ID 
NO: 1) had no discernable effect on 22Cll-mediated co- 
precipitation of G Q . In this experiment; solubilized 

15 brain membranes were incubated with 2 2 CI 1 -coated beads in 
the presence of 10 peptide 20 (SEQ ID NO: 1; 2nd and 
5th lanes) or 10 /xM of the control peptide, peptide 677 _ 695 
of APP (SEQ ID NO: 13; 3rd and 6th lanes), or in the 
absence of both of these peptides (1st and 4th lanes) . 

20 In this experiment, an anti-mouse IgG antibody different 
from that used in (a) was employed. 

Precipitation of G Q reconstituted with recombinant APP- 
antibody complex 

A baculovirus DNA encoding full-length APP 695 (SEQ 
25 ID NO: 9) was prepared as outlined in Fig. 3a. Authentic 
mouse APP 6g5 cDNA (SEQ ID NO: 9) was provided by Dr. 
Yoshiyuki Sakaki (University of Tokyo, Japan) (Yamada et 
al., Biochem. Biophys. Res. Commun. 149:665-671, 1987) in 
the vector pUC18. The Hindlll-BamHI fragment containing 
30 the entire coding region was initially subcloned into the 
vector pBR322 (pBR-APP) . A single BamHI site was 
inserted immediately before the ATG codon of the Hindlll- 
Sphl fragment. This BamHI site was inserted to permit 
efficient expression of the encoded APP protein in 
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baculovirus-infected cells. The BamHI site-inserted 
APP 695 -coding DNA (BamHI-APP 695 ) was constructed from the 
Hindlll-SphI fragment and pBR-APP, utilizing their 
internal Kpnl sites, and subcloned into pUC18. By using 
5 BamHI-APP 695 as template, two truncation mutants were 
generated and subcloned into pUC18. These mutants 
possess an insertion of two TGA codons immediately before 
(aN) or after (aC) the peptide 20 sequence. Each BamHI- 
BamHI fragment of these respective APP-variation-encoding 

10 pUC18 plasmids was inserted into the baculovirus 

transfer/ expression vector pVL1393 (Invitrogen) . The 
entire region that had been through a single-stranded 
intermediate was sequenced to confirm the absence of 
unwanted nucleotide changes. New insertions were 

15 generated by oligonucleotide-directed mutagenesis with a 
kit (Takara) by the method of Kunkel et al. (Meth. 
Enzymol. 154:367-382, 1987). For the insertion of a 
BamHI site, a restriction fragment encoding the ATG start 
codon was subcloned into the vector M13mpl8 and a single 

20 stranded template was generated. An oligonucleotide 
primer (CCACGCAGGATCACGGGATCCATGCTGCCCAGCTTG; SEQ ID 
NQ: 19) was used to introduce GGATCC (SEQ ID NO: 20) 
immediately before the start codon. Following primer 
extension, the phage was used to transform E. coli strain 

25 JM109. Plaques were selected and single stranded DNA was 
sequenced. A restriction fragment containing the mutated 
region was subcloned into pBR-APP. For the insertion of 
the stop codons, oligonucleotide primers 

[CAGTACACATCCATCTGATGACATCATGGCGTGGTG (SEQ ID NO: 21) and 
30 CGCCATCTCTCCAGTGATGAATGCAGCAGAACGGA (SEQ ID NO: 22)] and 
the M13mpl9 vector were used to introduce two sequential 
TGA stop codons. Using the method of Summers and Smith 
(Summers et al., Tex. Agric. Exp. Stn. Bull. 1555, 1987), 
baculoviruses incorp rating these APP cDNAs were 
35 generated using selection by immunoblot analysis with 
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22C11, and recovered by infecting Sf9 cells (Invitrogen) . 
Four days after treatment of Sf9 cells with the viruses, 
cells were homogenized and suspended in buffer A. After 
the solubilization of the pellet with buffer B, the 
5 supernatant (100 ng) was mixed overnight with 22C11- 

coated protein G-Sepharose in buffer C plus 2% BSA at 4°C 
on a shaker- After centrifugation, the precipitated 
beads were incubated with purified G Q (1 jig) in buffer C 
supplemented with 1.1 mM MgCl2 and 2% BSA for 8-24 h at 

10 4°C on a shaker. After washing four times with ice-cold 
buffer C, the centrifugation precipitate was subjected to 
SDS-PAGE, electroblotting, and immunodetection with the 
first antibodies (1 tig/ml of 22C11; 10 /xg/ml of ariti-Alz 
90; 1/1000 dilution of 1C1; 1/500 dilution of 4G5; 0.1 

15 Mg/ml of aGOl) and the second goat anti-mouse or anti- 
rabbit IgGs conjugated with HRP. (Immunodetection of 1C1 
and 4G5, both of which are mouse IgM (k) , was 
accomplished using as second antibody a mixture of HRP- 
conjugated anti-rabbit IgG, rabbit anti-mouse IgM and 

20 rabbit anti-mouse k antibodies.) The 
three AFP constructs prepared as described above are 
compared in the schematic diagram of Fig. 3b. The 
polypeptides encoded by all three constructs retain the 
entire transmembrane and extracellular domains of APP; 

25 while AN (SEQ ID NO: 23) lacks all of the peptide 20 

residues as well as the sequence on the carboxy terminal 
side of the peptide 20 region, AC (SEQ ID NO: 24) retains 
the peptide 20 sequence and is missing only the latter 
sequence • 

30 Sf9 cells were infected, using standard methods, 

by recombinant baculoviruses encoding full length APP 695 
CDNA (SEQ ID NO: 9), APP 1 . 656 cDNA (aN; SEQ ID NO: 23), or 

^^1-676 cDNA < aC ' se Q id N0: 24 )- In uninfected Sf9 
cells, no immunoreactivity for anti-APP or anti-G c 
35 antibodies was detected (data not shown) . The membranes 
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of Sf 9 cells infected with the baculoviruses encoding 
APP 695 (SEQ ID NO: 9), aN (SEQ ID NO: 23) , and aC (SEQ ID 
NO: 24) genes (referred to as Sf9-APP 695 , Sf9-AN, and Sf9- 
aC, respectively) were found to express, respectively, 
5 130- , 120- and 130-kDa proteins reactive with antibody 
22C11 (Fig, 3d, right side) . The Sf 9-APP 695 cells 
expressed APP at « 0.1% of the total membrane protein. 
When the membranes of the three types of infected cells 
were immunoprecipitated with antibody Anti-Alz 90 

10 (Boehringer Mannheim) , a mouse monoclonal antibody 

specific for an epitope corresponding to to residues 551- 
608 of APP (SEQ ID NO: 25; a section of APP that is 
within the extracellular domain), 130-kDa, 120-kDa, and 
130-kDa proteins were recognized in Sf9-APP 695 , Sf9-AN, 

15 and Sf9-AC cells, respectively (Fig. 3c, top panel). 
Membranes from ail three types of infected cells showed 
approximately equivalent reactivity to the antibody, 
indicating that at least this portion of the 
extracellular domain was intact on each of the three and 

20 that all three cell types express approximately equal 
amounts of recombinant protein. When the antibody used 
was 1C1, a mouse monoclonal prepared against a peptide 
corresponding to residues 677-695 of APP (SEQ ID NO: 13), 
only Sf9-APP 695 membranes were reactive, indicating that 

25 the region corresponding to the C -terminal portion of the 
cytoplasmic domain is missing from both AN (SEQ ID 
NO: 23) and AC (SEQ ID NO: 24) (Fig. 3c, middle panel). 
When the antibody used was 4G5, a mouse monoclonal 
antibody raised against a peptide corresponding to 

30 residues 657-676 of APP (SEQ ID NO: l; the peptide 20 
region of the cytoplasmic domain) , 130 kDa bands from 
both Sf9-APP 695 and Sf9-AC membranes reacted with the 
antibody, but Sf9-AN membranes did not, a demonstration 
that AN (SEQ ID NO: 23) but not AC (SEQ ID NO: 24) lacks 

35 the peptide 20 region of APP (Fig. 3c, bottom panel). 
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These experiments clearly indicate that the expressed 
proteins are recombinant APP^ggg (SEQ ID NO: 9) # . APP 1-656 
(SEQ ID NO: 23), and APP^^ (SEQ ID NO: 24) , 
respectively, as designed. 
5 The 22Cll-precipitates from these Sf9 membranes 

expressing various forms of APP were exposed to purified 
G 0 , reprecipitated with 22C11, and subjected to 
immunoblot analysis using anti-G Q a antibody aGOl 
(Fig, 3d, left four lanes) and by 22C11 (right four 

10 lanes). aGOl (Morishita et al., Eur. J. Biochem. 
174:87-94, 1988) was provided by Dr. Tomiko Asano; 
similar results were obtained when antibody GC/2 was 
substituted. The control lanes are 22Cll-precipitate 
exposed to G Q in the absence of Sf9 membranes. 

15 Approximately 1/10-1/20 (0.05-0.1 fig/tube) of the 
reconstituted G Q was precipitated, together with a 
comparable amount («0.1 jig/ tube) of APP. Easily 
detectable amounts of G Q a were present in the final 
precipitate when G Q was mixed with 22Cll-precipitates 

20 from Sf9-AC or Sf9-APP695 membranes, but essentially no 
G Q a was found in the final precipitate from Sf 9-aN 
membranes. Thus, formation of an APP-G Q complex requires 
the peptide 20 region, residues 657-676 (SEQ ID NO: 1) . 

In the experiment illustrated in Fig. 3e, 22C11- 

25 precipitates from Sf9-APP 695 membranes (100 fig protein 

each) were incubated with activated G Q (lanes 2 and 4) or 
unactivated G 0 (lanes 1 and 3); the final precipitates 
(left panel) and supernatants (right panel) were analyzed 
by simultaneous immunoblotting with 22C11 and aGOl 

30 antibodies. Activation of G Q was carried out by 

incubating G Q in 20 mM Hepes/NaOH (pH 7.4), 1 mM EDTA, 2 
mM MgCl 2 , and 1 /iM GTPyS overnight at room temperature. 
When G Q was incubated with GTPyS, no G 0 a associated with 
the APP-22C11 complex (Fig. 3e) , suggesting that the 
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activation state of the G protein regulates APP-G D 
association. 

This study suggests that APP functions as a 
receptor coupled to G 0 through the G Q -activator 
5 cytoplasmic domain His 657 -Lys 676 (SEQ ID NO: 1). APP has 
a point mutation in at least one form of familial 
Alzheimer's disease (Goate et al., Nature 349:704-706, 
1991) . A structural alteration of APP is therefore 
thought to be one cause of Alzheimer's disease, although 

10 it remains unknown how the mutation might produce the 
disease. One novel possibility suggested by this study 
is that the cytoplasmic, C-terminal fragment of APP is 
pathogenic. It has been suggested (Abraham et al., 
Biotechnology 7:147-153, 1989; Shivers et al., EMBO J. 

15 7:1365-1370, 1988; Kametani et al., Biomedical Research 
10:179-183, 1989) that the residual C-terminal portion of 
APP may remain in the cell membrane after abnormal 
cleavage of APP to produce 0/A4 protein in Alzheimer's 
disease neurons. By analogy with the oncogenic 

20 transformation of c-erb B into v-erb B, such a structural 
alteration of APP may alter its function and prompt APP 
to constitutively activate G Q . This hypothesis is 
consistent with the study (Yanker et al., Science 
245:417-420, 1989) indicating that recombinant expression 

25 of the C-terminal 105-residue portion of APP in neuronal 
cells evokes cell death, and with the reports that G Q 
activity is linked to neuronal growth cone motility 
(Strittmatter et al., BioEssays 13:127-134, 1990), axon 
and dendrite formation (Granneman et al., J. 

30 Neurochemistry 54:1995-2001, 1990), and memory (Guillen 
et al., EMBO J. 9:1449-1455, 1990). This study suggests 
that Alzheimer's disease is a disorder of an APP-G Q 
signalling system caused by structural alterations of 
APP. 
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Example 1 

The screening method of the invention can be 
carried out as follows: 

The assay used can be a very simple cell-free 
5 assay employing a first polypeptide consisting 

essentially of the couplone, or G Q -binding portion, of 
APP (SEQ ID NO: 1) and a second polypeptide consisting 
essentially of an APP-binding portion of G Q . This APP- 
binding portion of G Q may be the 15-residue segment 

10 identified as the anticouplone portion of G D (SEQ ID 
NO: 3), or it may be one or both of the two flanking 
regions, residues 1-3 (SEQ ID NO: 4) and residues 19-36 
(SEQ ID NO: 5) of G Q . Alternatively, longer portions, or 
all, of APP and/ or G Q can be used, or the appropriate 

15 portions of APP and/or G 0 can be linked to other 
polypeptides to form hybrid polypeptides with 
characteristics (such as altered immunoreactivity or 
enzymatic activity) that would improve detection of the 
endpoint of the assay. The assay is carried out by 

20 contacting the APP-based polypeptide with the G D -based 
polypeptide in the presence of a candidate compound, in 
parallel with a control assay containing no candidate 
compound, and determining whether the candidate compound 
inhibits co- immunoprecipitation of the first and second 

25 polypeptides (using either an antibody specific for the 
first polypeptide or an antibody specific for the second 
polypeptide) • Alternatively, activation of the second 
(G Q ) polypeptide may be the measured criterion: if so, 
the second polypeptide must include the GTP-binding 

30 region of G c (SEQ ID NO: 10) , and GTP or an appropriate 
non-hydrolyzable analog thereof (such as GTPyS or 
Gpp(NH)p) must be included in the assay. The assay may 
also be carried out using phospholipid vesicles prepared 
by standard methods (e.g., as described by Nishimoto et 

35 al., J. Biol. Chem. 264:14029-14038, 1989), provided that 
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the first (APP) polypeptide includes a region of 
hydrophobic amino acids [such as all (SEQ ID NO: 8) or a 
portion (e.g., SEQ ID NO: 7) of the transmembrane region 
of APP) that permit it to be anchored in the phospholipid 
5 bilayer. Alternatively, the assay may be carried out 
using intact cells or red cell ghosts which contain APP 
and G Q , or appropriate portions thereof • The cells may 
express the first and second polypeptides naturally or by 
virtue of genetic engineering, or the polypeptides may be 
10 introduced directly into the cells or ghosts by standard 
means. 

Example 2 

The progress of Alzheimer's disease may be halted 
or reversed by treating a patient with a compound which 

15 diminishes the activation of neural G 0 by truncated APP. 
Such a compound may be identified in a screening assay as 
described above, or may consist essentially of a 
polypeptide containing the amino acid sequence of (a) the 
couplone region of APP (SEQ ID NO: 1) , (b) the 

20 anticouplone region of G Q (SEQ ID NO: 3), or (c) the APP- 
associating region(s) of G Q (SEQ ID NO: 4 and/or 5), or a 
combination of (b) and (c) . Such polypeptides may be 
produced in quantity by standard recombinant means, or by 
standard synthetic techniques. To minimize proteolytic 

25 degradation in vivo, the carboxy and amino termini may be 
derivatized (e.g., with ester or amide groups), some or 
all of the amino acids may be replaced with D-amino 
acids, or particularly sensitive peptide linkages may be 
substituted with non-peptide bonds using standard 

30 methodology. To improve penetration of the blood-brain 
barrier (BBB) , the polypeptides may be altered to 
increase lipophilicity (e.g., by ester if icat ion to a 
bulky lipophilic moiety such as cholesteryl) or t supply 
a cleavable "targetor" moi ty that enhances retention on 
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the brain side of the barrier (Bodor et al. , Science 
257:1698-1700, 1992). Alternatively, the polypeptide may 
be linked to an antibody to the transferrin receptor, in 
order to exploit that receptor's role in transporting 
5 iron across the blood-brain barrier, as taught by Friden 
et al., Science 259:373-377, 1993. It is expected that 
an intravenous dosage equivalent to approximately 1 to 
100 /xrooles of the polypeptide of the invention per kg per 
day, or an intrathecally administered dosage of 

10 approximately 0.1 to 50 /moles per kg per day, will be 
effective in blocking activation of G Q in an Alzheimer's 
patient. If the polypeptide is sufficiently protected 
from proteolytic degradation, as described above, it may 
also be administered orally in appropriately higher 

15 doses. Alternatively, the compound may be incorporated 
into a slow-release implant to ensure a relatively 
constant supply of the therapeutic to the patient's 
brain. 
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30 
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His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 
1 5 10 15 

His Leu Ser Lye 
20 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1910 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
TGTGGCAGGG AAGGGGCCAC C ATG GGA TGT ACG CTG AGC GCA GAG GAG AGA 51 

Met Gly Cys Thr Leu Ser Ala Glu Glu Arg 
1 5 10 

GCC GCC CTC GAG CGG AGC AAG GCG ATT GAG AAA AAC CTA AAA GAA GAT 99 
Ala Ala Leu Glu Arg Ser Lys Ala lie Glu Lys Asn Leu Lys Glu Asp 
15 20 25 

GGC ATC AGC GCC GCC AAA GAC GTG AAA TTA CTC CTG CTG GGG GCT GGA 147 
Gly lie Ser Ala Ala Lys Asp Val Lys Leu Leu Leu Leu Gly Ala Gly 
30 35 40 

GAA TCA GGA AAA AGC ACC ATT GTG AAG CAG ATG AAG ATC ATC CAT GAA 195 
Glu Ser Gly Lys Ser Thr lie Val Lys Gin Met Lys lie lie His Glu 
45 50 55 

GAT GGC TTC TCT GGG GAA GAC GTG AAG CAG TAC AAG CCT GTG GTC TAC 243 
Asp Gly Phe Ser Gly Glu Asp Val Lys Gin Tyr Lys Pro Val Val Tyr 
60 65 70 

AGC AAC ACC ATC CAG TCT CTG GCG GCC ATT GTC CGG GCC ATG GAC ACT 291 
Ser Asn Thr lie Gin Ser Leu Ala Ala He Val Arg Ala Met Asp Thr 
75 80 85 90 

TTG GGC GTG GAG TAT GGT GAC AAG GAG AGG AAG ACG GAC TCC AAG ATG 339 
Leu Gly Val Glu Tyr Gly Asp Lys Glu Arg Lys Thr Asp Ser Lys Met 
95 100 105 

GTG TGT GAC GTG GTG AGT CGT ATG GAA GAC ACT GAA CCG TTC TCT GCA 387 
Val Cys Asp Val Val Ser Arg Met Glu Asp Thr Glu Pro Phe Ser Ala 
110 115 120 

GAA CTT CTT TCT GCC ATG ATG CGA CTC TGG GGC GAC TCG GGG ATC CAG 435 
Glu Leu Leu Ser Ala Met Met Arg Leu Trp Gly Asp Ser Gly He Gin 
125 130 135 

GAG TGC TTC AAC CGA TCT CGG GAG TAT CAG CTC AAT GAC TCT GCC AAA 483 
Glu Cys Phe Asn Arg Ser Arg Glu Tyr Gin Leu Asn Asp Ser Ala Lys 
140 145 150 

TAC TAC CTG GAC AGC CTG GAT CGG ATT GGA GCC GGT GAC TAC CAG CCC 531 
Tyr Tyr Leu Asp Ser Leu Asp Arg He Gly Ala Gly Asp Tyr Gin Pro 
155 160 165 " 170 
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ACT GAG CAG GAC ATC CTC CGA ACC AGA GTC AAA ACA ACT GGC ATC GTA 579 
Thr Glu Gin Asp He Leu Arg Thr Arg Val Lye Thr Thr Gly He Val 
175 180 185 

GAA ACC CAC TTC ACC TTC AAG AAC CTC CAC TTC AGG CTG TTT GAC GTC 627 
Glu Thr His Phe Thr Phe Lys Asn Leu His Phe Arg Leu Phe Asp Val 
190 195 200 

GGG GGC CAG CGA TCT GAA CGC AAG AAG TGG ATC CAC TGC TTT GAG GAT 675 
Gly Gly Gin Arg Ser Glu Arg Lys Lys Trp He His Cys Phe Glu Asp 
205 210 215 

GTC ACG GCC ATC ATC TTC TGT GTC GCA CTC AGC GGC TAT GAC CAG GTG 723 
Val Thr Ala He He Phe Cys Val Ala Leu Ser Gly Tyr Asp Gin Val 
220 225 230 

CTC CAC GAG GAC GAA ACC ACG AAC CGC ATG CAC GAG TCT CTC ATG CTC 771 
Leu His Glu Asp Glu Thr Thr Asn Arg Met His Glu Ser Leu Met Leu 
235 240 245 250 

TTC GAC TCC ATC TGT AAC AAC AAG TTT TTC ATT GAT ACC TCC ATC ATC 819 
Phe Asp Ser He Cys Asn Asn Lys Phe Phe He Asp Thr Ser He He 
255 260 265 

CTC TTC CTC AAC AAG AAA GAC CTC TTT GGC GAG AAG ATT AAG AAG TCA 867 
Leu Phe Leu Asn Lys Lys Asp Leu Phe Gly Glu Lys lie LyB Lys Ser 
270 275 280 

CCC TTG ACC ATC TGC TTT CCC GAA TAC CCA GGC TCC AAC ACC TAT GAA 915 
Pro Leu Thr He Cys Phe Pro Glu Tyr Pro Gly Ser Asn Thr Tyr Glu 
285 290 295 

GAT GCA GCT GCC TAC ATC CAA ACA CAG TTT GAA AGC AAA AAC CGC TCA 963 
Asp Ala Ala Ala Tyr He Gin Thr Gin Phe Glu Ser Lys Asn Arg Ser 
300 305 310 

CCC AAC AAA GAA ATT TAC TGT CAC ATG ACT TGT GCC ACA GAC ACG AAT 1011 
Pro Asn Lys Glu lie Tyr Cys His Met Thr Cys Ala Thr Asp Thr Asn 
315 320 325 330 

AAT ATC CAG GTG GTA TTC GAC GCC GTC ACC GAC ATC ATC ATT GCC AAC 1059 
Asn lie Gin Val Val Phe Asp Ala Val Thr Asp lie lie lie Ala Asn 
335 340 345 

AAT CTC CGG GGC TGC GGC TTG TAC TGACCTCTTG TCCTGTATAG CAACCTATTT 1113 
Asn Leu Arg Gly Cys Gly Leu Tyr 
350 


GACTGCTTCA 

TGGACTCTTT 

GCTGTTGATG 

TTGATCTCCT 

GGTAGCATGA 

CCTTTGGCCT 

1173 

TTGTAAGACA 

CACAGCCTTT 

CTGTACCAAG 

CCCCTGTCTA 

ACCTACGACC 

CCAGAGTGAC 

1233 

TGACGGCTGT 

GTATTTCTGT 

AGAATGCTGT 

AGAATACAGT 

TTTAGTTGAG 

TCTTTACATT 

1293 

TAGAACTTGA 

AAGGATTTTA 

AAAAACAAAA 

CAAAAACCAT 

TTCTCATGTG 

CTTTGTAGCT 

1353 

TTAAAAGAAA 

AAAGGAAAAC 

TCACCATTTA 

ATCCATATTT 

CCTTTTTATT 

TTGAAGTTTA 

1413 

AAAAAAAAAT 

GTCTGTACCC 

ACACCCTCCC 

CCTTCCCCAC 

CTCAGCAGAA 

CTGGGGCTGG 

1473 

CACACAGAGG 

CAGTGCTGGG 

CCTGGCGCCT 

CCCAGGGCTT 

CTGTGCAGCC 

CATGGCTGGT 

1533 

GGGAACATGT 

CAGGCTAGTC 

TGTCTAGAAG 

GCCACTGGCC 

ACTGTACCCA 

CCCTTCCCCA 

1593 
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TGCCTGTGGG CTGCCCAGAC ACCTCATATA CCACCAGGCA GTGGCAGCTC CGCCCTGCTC 1653 

AGCCATGCGA CTCCAAACAC ACTCAAAGTT TGCGTAGAAA AAGCACAGCT CTGGCAGGGG 1713 

TAGCTGCCAC AGACAACGCT CATCACCTAT AGAAATCCAG CCCTATAGAA GCAATTCACC 1773 

CAGCCCCTTC CTACACTCCC TTTGTGTTGT TAACTTTTTG GTTTTTCTGG TCCTAGTGAG 1833 

TGCCTCCCAT GCATACCTGA CCAGCTCTGC CAGTGTCTGG GGTCTGGGGA ACAGGGGTTG 1893 

TGTGGTTTGG TTTTTGG 1910 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY : linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Asp Ala Val Thr Asp lie lie lie Ala Lys Asn Leu Arg Gly Cys 
1 5 10 15 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY : 1 inear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Gly Cys 
1 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 5: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

lie Glu Lys Asn Leu Lys Glu Asp Gly lie Ser Ala Ala Lys Asp Val 
1 5 10 15 

Lys Leu 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH i 47 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Lys Lys Lys Gin Tyr Thr Ser He His His Gly Val Val Glu Val Asp 
15 10 15 

Ala Ala Val Thr Pro Glu Glu Arg His Leu Ser Lys Met Gin Gin Asn 
20 25 30 

Gly Tyr Glu Asn Pro Thr Tyr Lys Phe Phe Glu Gin Met Gin Asn 
35 40 45 


(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 7: 
<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 


Thr Val He Val He Thr Leu Val Met Leu 
15 10 


(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 8: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Gly Ala He He Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 
1 5 10 15 

He Val He Thr Leu Val Met Leu 
20 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2085 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doubl 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATG CTG CCC GGT TTG GCA CTG CTC CTG CTG GCC GCC TGG ACG GCT CGG 48 
Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Tip Thr Ala Arg 
1 5 10 15 

GCG CTG GAG GTA CCC ACT GAT GGT AAf OCT GQC CTG CTG GCT GAA CCC 96 
Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 

CAG ATT GCC ATG TTC TGT GGC AGA CTG AAC ATG CAC ATG AAT GTC CAG 144 
Gin He Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

AAT GGG AAG TGG GAT TCA GAT CCA TCA GGG ACC AAA ACC TGC ATT GAT 192 
Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 
50 55 60 

ACC AAG GAA GGC ATC CTG CAG TAT TGC CAA GAA GTC TAC< CCT GGA CTG 240 
Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Gly Leu 
65 70 75 80 

CAG ATC ACC AAT GTG GTA GAA GCC AAC CAA CCA GTG ACC ATC CAG AAC 288 
Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 
85 90 95 

TGG TGC AAG CGG GGC CGC AAG CAG TGC AAG ACC CAT CCC CAC TTT GTG 336 
Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 110 

ATT CCC TAC CGC TGC TTA GTT GGT GAG TTT GTA AGT GAT GCC CTT CTC 384 
He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 

GTT CCT GAC AAG TGC AAA TTC TTA CAC CAG GAG AGG ATG GAT GTT TGC 432 
Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

GAA ACT CAT CTT CAC TGG CAC ACC GTC GCC AAA GAG ACA TGC AGT GAG 480 
Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

AAG AGT ACC AAC TTG CAT GAC TAC GGC ATG TTG CTG CCC TGC GGA ATT 528 
Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly lie 
165 170 175 

GAC AAG TTC CGA GGG GTA GAG TTT GTG TGT TGC CCA CTG GCT GAA GAA 576 
Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

AGT GAC AAT GTG GAT TCT GCT GAT GCG GAG GAG GAT GAC TGC GAT GTC 624 
Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Cys Asp Val 
195 200 205 

TGG TGG GGC GGA GCA GAC ACA GAC TAT GCA GAT GGG AGT GAA GAC AAA 672 
Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 


GTA GTA GAA GTA GCA GAG GAG GAA GAA GTG GCT GAG GTG GAA GAA GAA 
Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 


720 
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GAA GCC GAT GAT GAC GAG GAC GAT GAG GAT GGT GAT GAG GTA GAG GAA 768 
Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 

GAG GCT GAG GAA CCC TAC GAA GAA GCC ACA GAG AGA ACC ACC AGC ATT 816 
Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 
260 265 270 

GCC ACC ACC ACC ACC ACC ACC ACA GAG TCT GTG GAA GAG GTG GTT CGA 864 
Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

GTT CCT ACA ACA GCA GCC AGT ACC CCT GAT GCC GTT GAC AAG TAT CTC 912 
Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 295 300 

GAG ACA CCT GGG GAT GAG AAT GAA CAT GCC CAT TTC CAG AAA GCC AAA 960 
Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 

GAG AGG CTT GAG GCC AAG CAC CGA GAG AGA ATG TCC CAG GTC ATG AGA 1008 
Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 
325 330 335 

GAA TGG GAA GAG GCA GAA CGT CAA GCA AAG AAC TTG CCT AAA GCT GAT 1056 
Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 
340 345 350 

AAG AAG GCA GTT ATC CAG CAT TTC CAG GAG AAA GTG GAA TCT TTG GAA 1104 
Lys Lys Ala Val lie Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 365 

CAG GAA GCA GCC AAC GAG AGA CAG CAG CTG GTG GAG ACA CAC ATG GCC 1152 
Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 
370 375 380 

AGA GTG GAA GCC ATG CTC AAT GAC CGC CGC CGC CTG GCC CTG GAG AAC 1200 
Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 

TAC ATC ACC GCT CTG CAG GCT GTT CCT CCT CGG CCT CGT CAC GTG TTC 1248 
Tyr lie Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phe 
405 410 415 

AAT ATG CTA AAG AAG TAT GTC CGC GCA GAA CAG AAG GAC AGA CAG CAC 1296 
Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 
420 425 430 

ACC CTG AAG CAT TTC GAG CAT GTG CGC ATG GTG GAT CCC AAG AAA GCC 1344 
Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 

GCT CAG ATC CGG TCC CAG GTT ATG ACA CAC CTC CGT GTG ATT TAT GAG 1392 
Ala Gin He Arg Ser Gin Val Met Thr His Leu Arg Val He Tyr Glu 
450 455 460 

CGC ATG AAT CAG TCT CTC TCC CTG CTC TAC AAC GTG CCT GCA GTG GCC 1440 
Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 

GAG GAG ATT CAG GAT GAA GTT GAT GAG CTG CTT CAG AAA GAG CAA AAC 1488 
Glu Glu lie Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 
485 490 495 
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TAT TCA GAT GAC GTC TTG GCC AAC ATG ATT AGT GAA CCA AGG ATC AGT 1536 . 

Tyr Ser Asp ABp Val Leu Ala Asn Met lie Ser Glu Pro Arg lie Ser 
500 505 510 

TAC GGA AAC GAT GCT CTC ATG CCA TCT TTG ACC GAA ACG AAA ACC ACC 1584 
Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 
515 520 525 

GTG GAG CTC CTT CCC GTG AAT GGA GAG TTC AGC CTG GAC GAT CTC CAG 1632 
Val Glu Leu Leu Pro Val ABn Gly Glu Phe Ser Leu ABp Asp Leu Gin 
530 535 540 

CCG TGG CAT TCT TTT GGG GCT GAC TCT GTG CCA GCC AAC ACA GAA AAC 1680 
Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 
545 550 555 560 

GAA GTT GAG CCT GTT GAT GCC CGC CCT GCT GCC GAC CGA GGA CTG ACC 1728 
Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 
565 570 575 

ACT CGA CCA GGT TCT GGG TTG ACA AAT ATC AAG ACG GAG GAG ATC TCT 1776 
Thr Arg Pro Gly Ser Gly Leu Thr Asn lie Lys Thr Glu Glu lie Ser 
580 585 590 

GAA GTG AAG ATG GAT GCA GAA TTC CGA CAT GAC TCA GGA TAT GAA GTT 1824 
Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 
595 600 605 

CAT CAT CAA AAA TTG GTG TTC TTT GCA GAA GAT GTG GGT TCA AAC AAA 1872 
His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys 
610 615 620 

GGT GCA ATC ATT GGA CTC ATG GTG GGC GGT GTT GTC ATA GCG ACA GTG 1920 
Gly Ala He He Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 
625 630 635 640 

ATC GTC ATC ACC TTG GTG ATG CTG AAG AAG AAA CAG TAC ACA TCC ATT 1968 
He Val He Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He 
645 650 655 

CAT CAT GGT GTG GTG GAG GTT GAC GCC GCT GTC ACC CCA GAG GAG CGC 2016 
His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 
660 665 670 

CAC CTG TCC AAG ATG CAG CAG AAC GGC TAC GAA AAT CCA ACC TAC AAG 2064 
His Leu Ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 
675 680 685 

TTC TTT GAG CAG ATG CAG AAC 2085 
Phe Phe Glu Gin Met Gin Asn 
690 695 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 


WO 94/196*2 


PCT/US94/01712 


- 33 - 


Lys Leu Leu Leu Leu Gly Ala Gly Glu Ser Gly Lys Ser Thr lie Val 
1 5 10 15 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu 
15 10 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Asp Ala Glu Phe Arg His Asp Ser Gly Tyr 
1 5 10 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 1 

Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys Phe Phe Glu Gin 
15 10 15 

Met Gin Asn 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Hie Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg His 
1 5 10 15 


(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg His Leu 
1 5 10 .15 

Ser Lys 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu 
15 10 15 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Lys Gin Tyr Thr Ser lie His His Gly Val Val Glu Val Asp Ala Ala 
1 5 10 15 

Val Thr Pro Glu Glu Arg His Leu Ser Lys 
20 25 


(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 18: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH x 30 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Thr Val He Val He Thr Leu Val Met Leu His His Gly Val Val Glu 
1 5 10 15 

Val Asp Ala Ala Val Thr Pro Glu Glu Arg His Leu Ser Lys 
20 25 30 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CCACGCAGGA TCACGGGATC CATGCTGCCC AGCTTG 36 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GGATCC 6 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 


CAGTACACAT CCATCTGATG ACATCATGGC GTGGTG 36 
(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 22: 


WO 94/19692 


PCT/US94/01712 


- 36 - 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 22: 

CGCCATCTCT CCAGTGATGA ATGCAGCAGA ACGGA 35 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 656 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
1 5 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met Hie Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr LyB Thr Cys lie Asp 
50 55 60 

Thr Lys Glu Gly lie Leu Gin Tyr Cys Gin Glu Val Tyr Pro Gly Leu 
65 70 75 80 

Gin lie Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr lie Gin Asn 
85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 110 

lie Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 

Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly lie 
165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Cys Asp Val 
195 200 205 
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Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu hla A«p Asp Aap Glu Asp Asn Glu Ast> Glv Aso Glu Val Glu Glu 
245 * * 250 " ~ 255 

• Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 295 300 

Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 
325 330 335 

Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 
340 345 350 

Lys Lys Ala Val lie Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 365 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 
370 375 380 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 

Tyr lie Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phe 
405 410 415 

Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 
420 425 430 

Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 

Ala Gin He Arg Ser Gin Val Met Thr His Leu Arg Val He Tyr Glu 
450 455 460 

Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 

Glu Glu He Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 
485 490 495 

Tyr Ser Asp Asp Val Leu Ala Asn Met He Ser Glu Pro Arg He Ser 
500 505 510 

Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 
515 520 525 

Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 
530 535 540 
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Pro Trp His Ser Phe Gly Ala Asp Ser VaX Pro Ala Asn Thr Glu Asn 
545 550 555 560 

Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 
565 570 575 

Thr Arg Pro Gly Ser Gly Leu Thr Asn He Lys Thr Glu Glu He Ser 
580 585 590 

Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 
595 600 60S 

His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys 
610 - 615 620 

Gly Ala He lie Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 
625 630 635 640 

He Val He Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He 
645 650 655 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 676 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 24: 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
1 5 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 

Gin He Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 
50 55 60 

Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Gly Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 
85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 HO 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 
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Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Fro Cys Gly lie 
165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

ser Asp abii Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Cys Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 295 300 

Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 
325 330 335 

Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 
340 345 350 

Lys Lys Ala Val lie Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 365 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 
370 375 380 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 . 395 400 

Tyr lie Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phe 
405 410 415 

Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 
420 425 430 

Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 

Ala Gin He Arg Ser Gin Val Met Thr His Leu Arg Val He Tyr Glu 
450 455 460 

Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 

Glu Glu He Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 
485 490 495 
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Tyr Ser Asp Asp Val Leu Ala Aan Met He Ser Glu Pro Arg He Ser 
500 505 510 

Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 
515 520 525 

Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 
530 535 540 

Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 
545 550 ^ 555 560 

Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 
565 570 575 

Thr Arg Pro Gly Ser Gly Leu Thr Asn He Lys Thr Glu Glu He Ser 
580 585 590 

Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 
595 600 605 

His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys 
610 615 620 

Gly Ala He lie Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 
625 630 635 640 

He Val He Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He 
645 650 655 

His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 
660 665 670 

His Leu Ser Lys 
675 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 

(B) TYPE: amino acid 

(C) STRAND ED NESS : 

(P) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Ala Asp Ser Val Pro Ala Asn Thr Glu Asn Glu Val Glu Pro Val Asp 
1 5 10 15 

Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr Thr Arg Pro Gly Ser Gly 
20 25 30 

Leu Thr Asn He Lys Thr Glu Glu He Ser Glu Val Lys Met Asp Ala 
.35 40 45 

Glu Phe Arg His Asp Ser Gly Tyr Glu Val 
50 55 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 26: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 

(B) TYPE: amino acid 

{ C ) STRAKDEDNESS : 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Val lie Val lie Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr Ser 
15 10 15 

He His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu 
20 25 30 

Arg His Leu Ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr 
35 40 45 

Lys Phe Phe Glu Gin Met Gin Asn 
50 55 


(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 27: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 

(B) TYPE: amino acid 

(C) STRANDEDNESS: b ingle 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
1 5 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 

Gin He Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 
50 55 60 

Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Gly Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 
85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys LyB Thr His Pro His Phe Val 
100 105 110 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 


WO 94/19692 


PCT/US94/01712 


- 42 - 

Val Pro Asp Lye Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 & 160 

Lvs Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly lie 
165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Cys Asp Val 
195 200 205 

Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 295 39O 

Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 
325 330 335 

Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 
340 345 350 

Lys Lys Ala Val lie Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 365 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 
370 375 380 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 

Tyr lie Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phe 
405 410 415 

Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 
420 425 430 

Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 


Ala Gin lie Arg Ser Gin Val Met Thr His Leu Arg Val He Tyr Glu 
450 455 460 
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Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 

Glu Glu He Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 
485 490 495 

Tyr Ser Asp Asp Val Leu Ala Asn Met He Ser Glu Pro Arg He Ser 
500 505 510 

Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lye Thr Thr 
515 520 525 

Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 
530 535 540 

Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 
545 550 555 560 

Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 
565 570 575 

Thr Arg Pro Gly Ser Gly Leu Thr Asn He Lys Thr Glu Glu He Ser 
580 585 590 

Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 
595 600 605 

His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Ash Lys 
610 615 620 

Gly Ala He He Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 
625 630 635 640 

He Val He Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He 
645 650 655 

His His Gly Val Val Glu Val ABp Ala Ala Val Thr Pro Glu Glu Arg 
660 665 670 

His Leu Ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 
675 680 685 

Phe Phe Glu Gin Met Gin Asn 
690 695 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2274 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

GCTGTGGCAG GGAAGGGGCC ACC ATG GGA TGT ACG CTG AGC GCA GAG GAG 50 

Met Gly Cys Thr Leu Ser Ala Glu Glu 
1 5 

AGA GCC GCC CTC GAG CGG AGC AAG GCG ATT GAG AAA AAC CTC AAA GAA 98 
Arg Ala Ala Leu Glu Arg Ser Lys Ala He lu Lys Asn Leu Lys Glu 
10 15 20 25 


WO 94/19692 


PCT/US94/01712 


- 44 - 

GAT GGC ATC AGC GCC GCC AAA OAC GTG AAA TTA CTC CTG CTG GGG GCT 146 
Asp Gly lie Ser Ala Ala Lye Asp Val Lys Leu Leu Leu Leu Gly Ala 
30 35 40 

GGA GAA TCA GGA AAA AGC ACC ATT GTG AAG CAG ATG AAG ATC ATC CAT 194 
Gly Glu Ser Gly Lys Ser Thr lie Val Lys Gin Met Lys He He His 
45 50 55 

GAA GAT GGC TTC TCT GGG GAA GAC GTG AAG CAG TAC AAG CCT GTG GTC 242 
Glu Asp Gly Phe Ser Gly Glu Asp Val Lys Gin Tyr Lys Pro Val Val 
60 65 70 

TAC AGC AAC ACC ATC CAG TCT CTG GCG GCC ATT GTC CGG GCC ATG GAC 290 
Tyr Ser Asn Thr He Gin Ser Leu Ala Ala He Val Arg Ala Met Asp 
75 80 85 

ACT TTG GGC GTG GAG TAT GGT GAC AAG GAG AGG AAG ACG GAC TCC AAG 338 
Thr Leu Gly Val Glu Tyr Gly Asp Lys Glu Arg Lys Thr Asp Ser Lys 
90 95 100 105 

ATG GTG TGT GAC GTG GTG AGT CGT ATG GAA GAC ACT GAA CCG TTC TCT 386 
Met Val Cys Asp Val Val Ser Arg Met Glu Asp Thr Glu Pro Phe Ser 
110 115 120 

GCA GAA CTT CTT TCT GCC ATG ATG CGA CTC TGG GGC GAC TCG GGG ATC * 434 
Ala Glu Leu Leu Ser Ala Met Met Arg Leu Trp Gly Asp Ser Gly He 
125 130 135 

CAG GAG TGC TTC AAC CGA TCT CGG GAG TAT CAG CTC AAT GAC TCT GCC 482 
Gin Glu Cys Phe Asn Arg Ser Arg Glu Tyr Gin Leu Asn Asp Ser Ala 
140 145 150 

AAA TAC TAC CTG GAC AGC CTG GAT CGG ATT GGA GCC GGT GAC TAC CAG 530 
Lys Tyr Tyr Leu Asp Ser Leu Asp Arg He Gly Ala Gly Asp Tyr Gin 
155 160 165 

CCC ACT GAG CAG GAC ATC CTC CGA ACC AGA GTC AAA ACA ACT GGC ATC 578 
Pro Thr Glu Gin Asp He Leu Arg Thr Arg Val Lys Thr Thr Gly He 
170 ' 175 180 185 

GTA GAA ACC CAC TTC ACC TTC AAG AAC CTC CAC TTC AGG CTG TTT GAC 626 
Val Glu Thr His Phe Thr Phe Lys Asn Leu His Phe Arg Leu Phe Asp 
190 195 200 

GTC GGG GGC CAG CGA TCT GAA CGC AAG AAG TGG ATC CAC TGC TTT GAG 674 
Val Gly Gly Gin Arg Ser Glu Arg Lys Lys Trp He His Cys Phe Glu 
205 ^ 210 ~ 215 

GAT GTC ACG GCC ATC ATC TTC TGT GTC GCA CTC AGC GGC TAT GAC CAG 722 
Asp Val Thr Ala He He Phe Cys Val Ala Leu Ser Gly Tyr Asp Gin 
220 225 230 

GTG CTC CAC GAG GAC GAA ACC ACG AAC CGC ATG CAC GAA TCC CTG AAG 770 
Val Leu His Glu Asp Glu Thr Thr Asn Arg Met His Glu Ser Leu Lys 
235 240 245 

CTC TTC GAC AGC ATC TGC AAC AAC AAG TGG TTC ACA GAC ACA TCT ATT 818 
Leu Phe Asp Ser He CyB Asn Asn Lys Trp Phe Thr Asp Thr Ser He 
250 ~ 255 260 265 

ATC CTG TTT CTC AAC AAG AAG GAC ATA TTT GAG GAG AAG ATC AAG AAG 866 
He Leu Ph Leu Asn Lys Lys Asp He Phe Glu Glu Lys He Lys Lys 
270 275 280 
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TCC CCA CTC ACC ATC TGC TTT CCT GAA TAC ACA CGC CCC AGT GCC TTC 914 
Ser Pro Leu Thr He Cys Phe Pro Glu Tyr Thr Gly Pro Ser Ala Phe 
285 " 290 295 

ACA GAA GCT GTG GCT CAC ATC CAA GGG CAG TAT GAG AGT AAG AAT AAG 962 
Thr Glu Ala Val Ala His He Gin Gly Gin Tyr Glu Ser Lys Asn Lys 

300 305 310 

TCA GCT CAC AAG GAA GTC TAC AGC CAT GTC ACC TGT GCC ACG GAC ACC 1010 
Ser Ala His Lys Glu Val Tyr Ser His Val Thr Cys Ala Thr Asp Thr 
315 320 325 

AAC AAC ATC CAA TTC GTC TTT GAT GCC GTG ACA GAT GTC ATC ATC GCC 1058 
Asn Asn He Gin Phe Val Phe Asp Ala Val Thr Asp Val He He Ala 
330 335 340 345 

AAA AAC CTA CGG GGC TGT GGA CTC TAC TGAGCCCTGG CCTCCTACCC 1105 
Lys Asn Leu Arg Gly Cys Gly Leu Tyr 
350 

AGCCTGCCAC TCACTCCTCC CCTGGACCCA GAGCTCTGTC ACTGCTCAGA TGCCCTGTTA 1165 

ACTGAAGAAA ACCTGGAGGC TAGCCTTGGG GGCAGGAGGA GGCATCCTTT GAGCATCCCC 1225 

ACCCCACCCA ACTTCAGCCT CGTGACACGT GGGAACAGGG TTGGGCAGAG GTGTGGAACA 1285 

GCACAAGGCC AGAGACCACG GCATGCCACT TGGGTGCTGC TCACTGGTCA GCTGTGTGTC 1345 

TTACACAGAG GCCGAGTGGG CAACACTGCC ATCTGATTCA GAATGGGCAT GCCCTGTCCT 1405 

CTGTACCTCT TGTTCAGTGT CCTGGTTTCT CTTCCACCTT GGTGATAGGA TGGCTGGCAG 1465 

GAAGGCCCCA TGGAAGGTGC TGCTTGATTA GGGGATAGTC GATGGCATCT CTCAGCAGTC 1525 

CTCAGGGTCT GTTTGGTAGA GGGTGGTTTC GTCGACAAAA GCCAACATGG AATCAGGCCA 1585 

CTTTTGGGGC QCAAAGACTC AGACTTTGGG GACGGGTTCC CTCCTCCTTC ACTTTGGATC 1645 

TTGGCCCCTC TCTGGTCATC TTCCCTTGCC CTTGGGCTCC CCAGGATACT CAGCCCTGAC 1705 

TCCCATGGGG TTGGGAATAT TCCTTAAGAC TGGCTGACTG CAAAGGTCAC CGATGGAGAA 1765 

ACATCCCTGT GCTACAGAAT TGGGGGTGGG ACAGCTGAGG GGGCAGGCGG CTCTTTCCTG 1825 

ATAGTTGATG ACAAGCCCTG AGAATGCCAT CTGCTGGCTC CACTCACACG GGCTCAACTG 1885 

TCCTGGGTGA TAGTGACTTG CCAGGCCACA GGCTGCAGGT CACAGACAGA GCAGGCAAGC 1945 

AGCCTTGCAA CTGCAGATTA CTTAGGGAGA AGCATCCTAG CCCCAGCTAA CTTTGGACAG 2005 

TCAGCATATG TCCCTGCCAT CCCTAGACAT CTCCAGTCAG CTGGTATCAC AGCCAGTGGT 2065 

TCAGACAGGT TTGAATGCTC ATGTGGCAGG GGGCCCGGTA CCCAGCTTTT GTTCCCTTTA 2125 

GTGAGGGTTA ATTGCGCGCT TGGGCTAATC ATGGTCATAG CTGTTGGGCG TTGCTGGCGT 2185 

TTTTCCATAG GCTCCGCCCC CTGACGAGAT CACAAAAATC GACGCTCAAG TCAGAGGTGG 2245 

CGAAACCGAC AGACTATAAG ATACCAGGC 2274 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 29: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 

(B) TYPE: amino acid 

(C) STRAND ED NESS ! 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Asp Val Gly Gly Gin Arg Ser Glu Arg Lys Lys Trp He His Cys Phe 
15 10 15 

Glu Asp 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 30: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 


Thr Ser He He Leu Phe Leu Asn Lys Lys Asp Leu 
1 5 10 
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CLAIMS 

1, A method of identifying a therapeutic useful 
for treating or preventing the symptoms of Alzheimer's 
disease, which method includes the steps of 
5 contacting (a) a first molecule comprising the 

couplone portion (SEQ ID NO: 1) of amyloid precursor 
protein (APP) with (b) a second molecule comprising an 
APP-associating region of G 0 (SEQ ID NOs: 3, 4, or 5) , in 
the presence of a candidate compound; and 
10 determining whether said candidate compound 

interferes with the association of said first and second 
molecules, said interference being an indication that 
said candidate compound is a therapeutic useful for 
treating Alzheimer's disease. 

15 2. The method of claim 1, wherein said 

determining step is accomplished by 

immmunoprecipitating said first molecule with an 
antibody specific for APP; and 

detecting the presence or amount of said second 
20 molecule which co-precipitates with said first molecule. 

3. The method of claim 1, wherein said 
determining step is accomplished by 

immunoprecipitating said second molecule with an 
antibody specific for G 0 ; and 
25 detecting the presence or amount of said first 

molecule which co-precipitates with said second molecule. 


4. The method of claim 1, wherein said first 
molecule comprises the portion of APP 695 from residues 649 
to 695 (SEQ ID NO: 6). 
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5. The method of claim 1, wherein said first 
molecule comprises the portion of APP 695 from residues 639 
to 648 (SEQ ID NO: 7) . 

6. The method of claim 1, wherein said first 

5 molecule comprises the portion of APP 695 from residues 640 
to 695 (SEQ ID NO: 26) . 

7. The method of claim 6, wherein said first 
molecule comprises essentially all of APP 695 (SEQ ID 
NO: 27). 

10 8. The method of claim 1, wherein said second 

molecule comprises the GTP-binding region of G Q (SEQ ID 
NO: 10) . 

9 . The method of claim 8 , wherein said second 
molecule comprises essentially all of G D (SEQ ID NO: 2) . 

15 10. A method of assaying for a therapeutic useful 

for treating Alzheimer's disease, which method includes 

the steps of 

contacting (a) a first molecule comprising the 

couplone region of APP (SEQ ID NO: 1) with (b) a second 
20 molecule comprising an APP-associating region of G Q (SEQ 

ID NO: 3, 4, or 5), in the presence of a candidate 

compound; and 

determining whether said candidate compound 

interferes with the activation of said second molecule by 
25 said first molecule, said interference being an 

indication that said candidate compound is a therapeutic 

useful for treating Alzheimer's disease. 

11. The method of claim 10, wherein said 
determining step is acc mplished by 


WO 94/19692 


PCT/US94/01712 


- 49 - 

contacting said second molecule with a substrat 
comprising GTP or an analog of GTP; and 

detecting or measuring the binding of said 
substrate to said second molecule, wherein said binding 
5 is evidence of said activation of said second molecule by 
said first molecule. 

12. The method of claim 1, wherein said 
contacting step is carried out at a Mg 2+ concentration 
between lxlO" 7 and lxlO" 2 M. 

10 13. The method of claim 10, wherein said 

contacting step is carried out at a Mg 2+ concentration 
between lxlO" 7 and lxl 0" 2 M. 

14. The method of claim 1, wherein said 
contacting step is carried out in a cell-free system. 

15 15. The method of claim 10 , wherein said 

contacting step is carried out in a cell-free system. 

16. A system for screening candidate Alzheimer's 
disease therapeutics, which system comprises 

a first polypeptide comprising a sequence 
20 essentially identical to that of peptide 20 (SEQ ID 
NO: 1); 

a second polypeptide comprising a sequence 
essentially identical to the anticouplone sequence of G Q 
(SEQ ID NO: 3) ; and 
25 a means for detecting either (a) the association 

of said first polypeptide with said second polypeptide, 
or (b) the activation of said second polypeptide by said 
first polypeptide. 
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17. A cell-fre system for screening candidate 
Alzheimer's disease therapeutics, which system comprises 

a first polypeptide comprising a sequence 
essentially identical to that- of peptide 20 (SEQ ID 

5 NO: l) ; and 

a second polypeptide comprising a sequence 
essentially identical to the anticouplone sequence of G Q 
(SEQ ID NO: 3). 

18. The system of claim 17, wherein said first 
10 polypeptide is anchored to a solid material or is in a 

phospholipid vesicle. 

19. The system of claim 17, wherein said second 
polypeptide further comprises residues 1 to 3 (SEQ ID 
NO: 4) and 19 to 36 (SEQ ID NO: 5) of G c . 

15 20. The system of claim 19, wherein said second 

polypeptide comprises G 0 1 or G Q 2. 

21. A method for diminishing the activation of G D 
in a neuronal cell by treating the cell with a compound 
which blocks association of G D with the cytoplasmic tail 

20 of APP. 

22. The method of claim 21, wherein the compound 
is a peptide fragment of G Q or of the cytoplasmic tail of 
APP. 

23. The method of claim 21, wherein said cell is 
25 within an animal. 

24. The method of claim 23, wherein said animal 
is a human. 
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25. A method for preventing or treating 
Alzheimer's disease in a patient, comprising treating the 
patient with a compound which blocks association of G Q 
with the cytoplasmic tail of APP. 

5 26. A method for preventing or treating 

Alzheimer's disease in a patient, comprising treating the 
patient with a compound which inhibits activation of 
neuronal G Q by the cytoplasmic tail of APP. 

27. A peptide having less than 50 amino acids and 
10 comprising the sequence of peptide 20 (SEQ ID NO: 1). 

28. A therapeutic composition comprising the 
peptide of claim 27 and a pharmaceutically acceptable 
carrier . 

29. A method for identifying a ligand for which 
15 APP is a receptor, which method includes the steps of 

providing an APP molecule and a G Q molecule; 
contacting a candidate compound with the 
extracellular domain of said APP molecule, the 
cytoplasmic tail of said APP molecule being accessible to 
20 said G 0 molecule, and 

detecting either (a) association of said G 0 
molecule with said APP molecule, or (b) activation of 
said G Q molecule by said APP molecule, said association 
or activation being evidence that said candidate compound 
25 is a ligand of APP. 
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